learning and transfer
Why academic research in AI is a total waste of time
Jeremy Howard, a creator of fast.ai and an ex-President of Kaggle says that most of the research in the deep learning world is a total waste of time. He explains why it is so and what is currently being under studied i.e. active learning and transfer learning. Active learning and transfer learning are further elaborated in this blog post. When asked a question "what's wrong with Artificial Intelligence?", However, when you literally dig into the question, the industry of AI is fighting its own demons.
Hot papers on arXiv from the past month: August 2021
Reproduced under a CC BY 4.0 license. Here are the most tweeted papers that were uploaded onto arXiv during August 2021. Results are powered by Arxiv Sanity Preserver. How to avoid machine learning pitfalls: a guide for academic researchers Michael A. Lones Submitted to arXiv on: 5 August 2021 Abstract: This document gives a concise outline of some of the common mistakes that occur when using machine learning techniques, and what can be done to avoid them. It is intended primarily as a guide for research students, and focuses on issues that are of particular concern within academic research, such as the need to do rigorous comparisons and reach valid conclusions.
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Tirumala, Dhruva, Noh, Hyeonwoo, Galashov, Alexandre, Hasenclever, Leonard, Ahuja, Arun, Wayne, Greg, Pascanu, Razvan, Teh, Yee Whye, Heess, Nicolas
As reinforcement learning agents are tasked with solving more challenging and diverse tasks, the ability to incorporate prior knowledge into the learning system and to exploit reusable structure in solution space is likely to become increasingly important. The KL-regularized expected reward objective constitutes one possible tool to this end. It introduces an additional component, a default or prior behavior, which can be learned alongside the policy and as such partially transforms the reinforcement learning problem into one of behavior modelling. In this work we consider the implications of this framework in cases where both the policy and default behavior are augmented with latent variables. We discuss how the resulting hierarchical structures can be used to implement different inductive biases and how their modularity can benefit transfer. Empirically we find that they can lead to faster learning and transfer on a range of continuous control tasks.
- Europe > United Kingdom > England > Greater London > London (0.04)
- Asia > South Korea > Gyeongsangbuk-do > Pohang (0.04)
- Asia > Middle East > Jordan (0.04)
- Asia > India > West Bengal > Kolkata (0.04)